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SEQUENCE MISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: The Scripps 

(B) STREET: 10666 Nort 

Mail Drop TPC8 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) POSTAL CODE (Zl 

(G) TELEPHONE: 619- 

(H) TELEFAX: 61 9-! 



jsearch Institute 
Torrey Pines Road, Suite 220, 



92037 
54-2937 
-6312 



(ii) TITLE OF INVENTION, 
LIGATION ^ 



SYNTHESIS OF PROTEINS BY NATIVE CHEMICAL 



(iii) NUMBER OF SEQUENCES': 20 



(iv) COMPUTER READ 

(A) MEDIUM TYP, 

(B) COMPUTER: 

(C) OPERATING 

(D) SOFTWARE/ PJ 



BLE FOF 
Flgppy disk 

C compatible 
TEM^PC-DOS/MS-DOS 

ri Release #1.0, Version #1.25 (EPO) 



(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US 

(B) FILING DATE: 04-MAY-1995 



(2) INFORMATIONypOR SEQ (D NO:1: 

(i) SEQUENCE/CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE/ amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLEOULE TYPE: peptide 



(ix) FEATURE: 
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(A) NAME/KEY: Modified-sfte 

(B) LOCATION: 5 

(D) OTHER INFORMATION^ /label = COSH 
/note= "Wherein COSH is thioacid." 



PCT/US95/05668 



(xi) SEQUENCE DESCRIPTION SEQ ID NO:1: 

Leu Tyr Arg Ala Gly 
1 5 

(2) INFORMATION FOR SEQ it) NO:2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amintf acids 

(B) TYPE: amino acic 

(C) STRANDEJ3NET3S: jingle 

(D) TOPOLOGY: liAear' 

(ii) MOLECULE TYPE: (iptide^ 



(xi) SEQUENCE DESQRIF 



v. SEQ ID NO:2: 



Cys Arg Ala Glu Tytf Ser 
1 5 

(2) INFORMATION FOfc SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: p amino acids 

(B) TYPE: arrfino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE JTYPE: peptide 



(ix) FEATURE} 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = COSBn 

/note = "Wherein COSBn is benzyl thioester. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID 

Leu Tyr Arg Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS} 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modtfrf 

(B) LOCATION: 5 

(D) OTHER INFORMA"fflONy /label = X 

/note= "Wherein b< isNtacetyl-cysteine-thioester." 



(xi) SEQUENCE DESCRIPTllRN: SEQ ID NO:4: 

Leu Tyr Arg Ala Gly 
1 5 

(2) INFORMATION FORjSEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: yl amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Leu Tyr Ara Ala Gly Cys Arg Ala Glu Tyr Ser 
1 J 5 10 
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(2) INFORMATION FOR SEQ ID NO|6: 

(i) SEQUENCE CHARACTERIST/CS: 

(A) LENGTH: 5 amino acid: 

(B) TYPE: amino acid 

(C) STRANDEDNESS: singj 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide) 



(ix) FEATURE: 

(A) NAME/KEY: Modifier-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = SCH2COOH 

/note= "Wherein/SCH2COOH is 2-thioacetic acid. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Leu Tyr Arg Ala Gly' 
1 5 

(2) INFORMATION FOR SEjfo IDM):\7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 acWiho ^elds 
IB) TYPE: amino acidi 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE/ peptide 



(ix) FEATURE: / 

(A) NAME/KEY/ Modified-site 

(B) LOCATION) 33 

(D) OTHER INFORMATION: /label = COSH 
/note= "M/herein COSH is thioacid." 

(ix) FEATURE: / 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = Msc 
/notep "Wherein Msc is 
2-me/hyl-sulfonyl-ethyloxy-carbonyl." 
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(xi) SEQUENCE DESCRIPTION: SEQ ID 

Ser Ala Lys Glu Leu Arg Cys Gin Cys life Lys Thr Tyr Ser Lys Pro 
1 5 10/15 

Phe His Pro Lys Phe lie Lys Glu Leu Arg Val He Glu Ser Gly Pro 
20 25 / 30 



Ala 



(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peA\di 



(ix) FEATURE: 

(A) NAME/KEY: MoqM 

(B) LOCATION: 33 
(D) OTHER INFORM/ 

/note= "Whereft 



(ON: /la^el= COSBn 
COSBn is benzyl thioester. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Ser Ala Lys Glu Leu Arg Cys Gin Cys He Lys Thr Tyr Ser Lys Pro 
1 5/10 15 

Phe His Pro Lys Phe He Lys Glu Leu Arg Val He Glu Ser Gly Pro 
20 / 25 30 



Ala 



(2) INFORMATION FOR SEQ ID NO:9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYRE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Cys Ala Asn Thr Glu He He Val Lys Leu Ser Asp Gly Arg Glu Leu 
1 5 10/ 15 

Cys Leu Asp Pro Lys Glu Asn/Trp Val Gin Arg Val Val Glu Lys Phe 
20 25 / 30 

Leu Lys Arg Ala Glu Asn Set; 
35 

(2) INFORMATION FOR SEQ 10 NO: 10: 



(i) SEQUENCE CHAR> 

(A) LENGTH: 72 

(B) TYPE: amino 

(C) STRANDEDNE 

(D) TOPOLOGY: Ii] 

(ii) MOLECULE TYPE: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 
(D) OTHER INFQI 



USTICS: 
icids 

singRj 

r 

/ 
ptide 

itiea^site 
NATION: /label = SH4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Ala Lys Glu l/eu Arg Cys Gin Cys He Lys Thr Tyr Ser Lys Pro 
1 5/10 15 

Phe His Pro Lys/ Phe He Lys Glu Leu Arg Val He Glu Ser Gly Pro 
20 / 25 30 

Ala Cys Ala Alsn Thr Glu He He Val Lys Leu Ser Asp Gly Arg Glu 
35 / 40 45 
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Leu Cys Leu Asp Pro Lys Glu Asn Trp Val Gln/\rg Val Val Glu Lys 
50 55 60 

Phe Leu Lys Arg Ala Glu Asn Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO:1 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 40 

(D) OTHER INFORMATIOlM/label* COSNB 

/note = "Wherein COtfNB is 5/-thio-2-nitro-benzoic 
acid ester." 



(ix) FEATURE: 

(A) NAME/KEY: Modifiers 

(B) LOCATION: 27 
ID) OTHER INFORMATI, 

/note= "Wherein 




/label = Xga 

is 2-Amjnobutyric acid. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 1: 

Pro Gin lie Thr Leu Trpi.ys Arg Pro Leu Val Thr He Arg lie Gly 
1 5 / 10 15 

Gly Gin Leu Lys Glu /(la Leu Leu Asp Thr Gly Ala Asp Asp Thr Val 
20 / 25 30 

He Glu Glu Met Asa Leu Pro Gly 
35 / 40 



(2) INFORMATION F/OR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 59 amino acitfs 

(B) TYPE: amino acid 

(C) STRANDEDNESS: singfe 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide] 



(ix) FEATURE: 

(A) NAME/KEY: Modifier-site 

(B) LOCATION: 27 

(D) OTHER INFORMATION: /label - Xaa 

/note = "Wherein ^<aa is 2-Aminobutyric acid. 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 55 

(D) OTHER INFORMATION: /label = Xaa 

/note= "Vtffoe^e/nXaa is 2-Aminobutyric acid. 



(xi) SEQUENCE DESCRIpTIOpd: SEQ ID NO: 12: 
Cys Trp Lys Pro Lys Met He Gly^Gly lie Gly Gly Phe lie Lys Val 



1 



Arg Gin Tyr Asp Gli 
20 

Gly Thr Val Leu Val) 
35 



10 



15 



e/Pro Val Gib lie Xaa Gly His Lys Ala lie 
25 / 30 

y Pro Trypro Val Asn He He Gly Arg Asn 
45 



Leu Leu Thr Gin lle/Gly Xaa Thr Leu Asn Phe 
50 J55 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :/40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE/TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Modified-sfte 

(B) LOCATION: 40 

(D) OTHER INFORMATION: /label = COSBn 
/note= "Wherein CQSBn is ??.' 

(ix) FEATURE: 

(A) NAME/KEY: Modifier-site 

(B) LOCATION: 40 

(D) OTHER INFORMATION: /label = COSBn 

/note= "Wherein/COSBn is benzyl thio ester." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Gin He Thr Leu Trp llys Arg Pro Leu Val Thr lie Arg He Gly 
1 5 / 10 15 

Gly Gin Leu Lys Glu AlL-Leu Leu Asp Thr Gly Ala Asp Asp Thr Val 



20 

He Glu Glu Met Asr 
35 



30 



ju Pro'Gly 
/40 



(2) INFORMATION FOP/$EQ ID NO: 14: 

.CTERISTICS: 
min 
dcid 

iear 

(ii) MOLECULE TYPE: peptide 



(i) SEQUENCE CH 

(A) LENGTH: 4CVafm«no acid 

(B) TYPE: amir/o 

(C) STRANDEBN 

(D) TOPOLOGV: li 



le 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 40 

(D) OTHER INFORMATION: /label = COSPh 

/note = "Wherein COSPh is phenyl thioester. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Pro Gin lle/Thr Leu Trp Lys Arg Pro Leu Val Thr He Arg He Gly 
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1 



Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr G/ly Ala Asp Asp Thr Val 
20 25 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIP 

Pro Gin lie Thr Leu Trp JLy 
1 5 




(ix) FEATURE: 

(A) NAME/KEY: Modified-sit<= 

(B) LOCATION: 67 

(D) OTHER INFORMATION://label = Xaa 

/note= "Wherein Xaa/js amino butyric acid." 



(ix) FEATURE: 

(A) NAME/KEY: Modifftd/site, 

(B) LOCATION: 95 
(D) OTHER INFORMA 

/note= "Whereiih 



= Xaa 
inobutyric acid." 



N0:15: 

Leu Val Thr lie Arg He Gly 
15 



Gly Gin Leu Lys Glu Afla Leu' Leu Asp Thr Gly Ala Asp Asp Thr Val 
20 / 25 30 

He Glu Glu Met Asn/Leu Pro Gly Cys Trp Lys Pro Lys Met He Gly 
35 / 40 45 

Gly lie Gly Gly Ph^ He Lys Val Arg Gin Tyr Asp Gin He Pro Val 
50 / 55 60 



Glu He Xaa Gly /His Lys Ala He Gly Thr Val Leu Val Gly Pro Thr 
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Ala Gin Val lie Asn Thr 
1 5 

Tyr His Lys Leu Pro A 

20 / 2 




Pro Val Asn He He Gly Arg Asn Leu Leu Thr Glr/lle Gly Xaa Thr 
85 90 95> 

Leu Asn Phe 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-sit^ 

(B) LOCATION: 48 

(D) OTHER INFORMATfON/>Jabel= COSNB 

/note = "Whersi/i COjSNp is 5-thio-2-nitro benzoic 
acid ester." 



(xi) SEQUENCE DESCRPTjON: SEQ Id NO: 16 



'Asp Gly val Ala Asp Tyr Leu Gin Thr 
10 / 15 

sp Tyf/He Thr Lys Ser Glu Ala Gin Ala 
30 



Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val Ala Pro Gly 
35 / 40 45 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE:/amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID/NO: 17: 



Cys Ser He Gly Gly Asp He Phe Ser 
1 5 10 



Hsn Arg Glu Gly Lys Leu Pro 
15 



Gly Lys Ser Gly Arg Thr Trp Arg Glu Ala Asp lie Asn Tyr Thr Ser 
20 25 / 30 

Gly Phe Arg Asn Ser Asp Arg He/Leu Tyr Ser Ser Asp Trp Leu He 
35 40 / 45 

Tyr Lys Thr Thr Asp His Tyr Gfn Thr Phe Thr Lys lie Arg 
50 55 / 60 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino/acids 

(B) TYPE: amino 

(C) STRANDEDNE$j6:/sjfogle 

(D) TOPOLOGY: lir 



(ii) MOLECULE TYPE: 



Dtide 



(ix) FEATURE: 

(A) NAME/KEY: Modjfied-s 

(B) LOCATION: 48 
(D) OTHER INFORMATION: /label = COSBn 

/note = "Wherein COSBn is benzyl thio ester. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Ala Gin Val He Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gin Thr 
1 5 / 10 15 

Tyr His Lys Lt£u Pro Asn Asp Tyr He Thr Lys Ser Glu Ala Gin Ala 
20 / 25 30 
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Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala As£ Val Ala Pro Gly 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 48 

(D) OTHER INFORMATION: /label = COSPh 

/note= "Wherein COSBh is phenyl thio ester." 




(xi) SEQUENCE DESCRIPTIO 



Q ID NO: 19: 



Ala Gin Val lie Asn Thr PI 
1 5 

Tyr His Lys Leu Pro Asn 
20 



Leu Gly Trp Val Ala Sei 
35 



y Val Ala Asp Tyr Leu Gin Thr 
15 



hr Lys Ser Glu Ala Gin Ala 
30 



Ala Asp Val Ala Pro Gly 



(2) INFORMATION FOR/SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: yl 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE/TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:/0: 



Ala Gin Val lie Asn Thr Phe Asp Gly 
1 5 10 



Cys Ser He Gly Gly Asp 
50 55 



Gly Lys Ser Gly Arg 
65 70. 




Asp Tyr Leu Gin Thr 
15 



Tyr His Lys Leu Pro Asn Asp Tyr lief Thr Lys Ser Glu Ala Gin Ala 
20 25 / 30 



Leu Gly Trp Val Ala Se 
35 



n Leu Ala Asp Val Ala Pro Gly 
45 



sn Arg Glu Gly Lys Leu Pro 



la Asp lie Asn Tyr Thr Ser 
80 



Gly Phe Arg Astr Ser Asp Arg He Leu Tyr Ser Ser Asp Trp Leu He 
85/ 90 95 



Tyr Lys Tiythr Asp His Tyr Gin Thr Phe Thr Lys He Arg 
100 105 110 



